Skip to content

issue/471: add AWQ dequantize in moore gpu, with python test pass and 9G4B-AWQ inference test pass#485

Merged
PanZezhong1725 merged 1 commit into
mainfrom
issue/471
Sep 29, 2025
Merged

issue/471: add AWQ dequantize in moore gpu, with python test pass and 9G4B-AWQ inference test pass#485
PanZezhong1725 merged 1 commit into
mainfrom
issue/471

Conversation

@spike-zhu
Copy link
Copy Markdown
Contributor

@spike-zhu spike-zhu commented Sep 29, 2025

摩尔平台 AWQ 量化支持,使用 MUSA C++ Kernel 实现 AWQ 量化。

python单测:
image

9G4B量化推理测试:
image

@spike-zhu spike-zhu changed the title issue: add AWQ dequantize in moore gpu, with test pass issue/471: add AWQ dequantize in moore gpu, with test pass Sep 29, 2025
@spike-zhu spike-zhu self-assigned this Sep 29, 2025
Comment thread src/infiniop/ops/dequantize_awq/moore/dequantize_w42f16_kernel.h Outdated
@spike-zhu spike-zhu changed the title issue/471: add AWQ dequantize in moore gpu, with test pass issue/471: add AWQ dequantize in moore gpu, with python test pass and 9G4B inference test pass Sep 29, 2025
@spike-zhu spike-zhu changed the title issue/471: add AWQ dequantize in moore gpu, with python test pass and 9G4B inference test pass issue/471: add AWQ dequantize in moore gpu, with python test pass and 9G4B-AWQ inference test pass Sep 29, 2025
@PanZezhong1725 PanZezhong1725 merged commit fbfb0ef into main Sep 29, 2025
8 checks passed
@PanZezhong1725 PanZezhong1725 deleted the issue/471 branch September 29, 2025 09:19
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants